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Abstract. Competitive neural networks arc often used to model the dynamics of perceptual 
bistability. Switching between percepts can occur through fluctuations and/or a slow adaptive pro- 
cess. Here, we analyze switching statistics in competitive networks with short term synaptic depres- 
sion and noise. We start by analyzing a ring model that yields spatially structured solutions and 
complement this with a study of a space-free network whose populations are coupled with mutual 
inhibition. Dominance times arising from depression driven switching can be approximated using 
a separation of timescales in the ring and space-free model. For purely noise-driven switching, we 
use energy arguments to justify how dominance times are exponentially related to input strength. 
We also show that a combination of depression and noise generates realistic distributions of domi- 
nance times. Unimodal functions of dominance times are more easily differentiated from one another 
using Bayesian sampling, suggesting synaptic depression induced switching transfers more informa- 
tion about stimuli than noise-driven switching. Finally, we analyze a competitive network model 
of perceptual tristability, showing depression generates a memory of previous percepts based on the 
ordering of percepts. 
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1. Introduction. Ambiguous sensory stimuli with two interpretations can pro- 
duce perceptual rivalry [5] . For example, two orthogonal gratings presented to either 
eye lead to perception switching between one grating and then the next repetitively, 
a common paradigm known as binocular rivalry [31]. Perceptual rivalry can also be 
triggered by a single stimulus with two interpretations, like the Necker cube [39] . One 
notable feature of the switching process in perceptual rivalry is its stochasticity - a 
histogram of the dominance times of each percept spreads across a broad range [17]. 
Senses other than vision also exhibit perceptual rivalry. When two different odorants 
are presented to the two nostrils, a similar phenomenon occurs with olfaction, termed 
"binaral" rivalry [55]. Similar experiences have been evoked in the auditory [13, 41] 
and tactile [10] system. 

Multiple principles govern the relationship between the strength of percepts and 
the mean switching statistics in perceptual rivalry [32]. "Levclt's propositions" relate 
stimulus contrast to the mean dominance times: (i) increasing the contrast of one 
stimulus increases the proportion of time that stimulus is dominant; (ii) increasing 
the contrast of one stimulus does not affect its average dominance time; (iii) increas- 
ing contrast of one stimulus increases the rivalry alternation rate; and (iv) increasing 
the contrast of both stimuli increases the rivalry alternation rate. There are also 
relationships between the properties of the input and the stochastic variation in the 
dominance times [6]. For instance, the distribution of dominance times is well fit 
by a gamma distribution [17, 30, 47]. The fact that dominance times are not expo- 
nentially distributed suggests some background slow adaptive process plays a role in 
providing a nonzero peak in the dominance histograms [44] . Two commonly proposed 
mechanisms for this adaptation are spike frequency adaptation and short term synap- 
tic depression [29, 49, 43]. An even stronger case can be made of the existence of 
adaptation in perceptual processing networks by examining results of experiments on 
perceptual tristability [21]. Here, perception alternates between three possible choices 
and subsequent switches are determined by the previous switch [38]. This memory 
suggests switches in perceptual multistability are not purely noise-driven [37]. 

Most theoretical models of perceptual rivalry employ two pools of neurons, each 



selective to one percept, coupled to one another by mutual inhibition [33, 29, 43, 42]. 
With no other mechanisms at work, such architectures lead to winner-take- all states, 
where one pool of neurons inhibits the other indefinitely [48]. However, switches 
between the dominance of one pool and the other can be initiated with the inclusion 
of fluctuations [37] or an adaptive process [29, 43]. Combining the two effects leads to 
dominance times that are distributed according to the gamma distribution [29, 44, 47]. 
Slow adaptation and noise thus serve as agents for the sampling of the stimulus 
through network activity. A mutual inhibitory network would otherwise remain in 
the winner-take- all state indefinitely. 

In light of these observations, we wish to consider the role adaptive mechanisms 
play in properly sampling ambiguous stimuli in the context of a mutual inhibitory 
network. Purely fluctuation driven switching would provide a noisy sample of the two 
percepts, but pure adaptation would provide an extremely reliable sampling of percept 
contrast [44]. Thus, as the level of adaptation is increased and noise is decreased, one 
would expect that the ability of mutual inhibitory networks to encode information 
about ambiguous stimuli is vastly improved. A major point is that it is also vastly 
improved over networks without any adaptation at all. We focus specifically on short 
term synaptic depression [1, 45]. 

Using parametrized models, we will explore how synaptic depression improves the 
ability of a network to extract stimulus contrasts. First, we will be concerned with how 
much information can be determined about the contrast of each of the two percepts 
of an ambiguous stimulus. In the case of a winner-take-all solution, only information 
about a single percept could be known, since the pool of neurons encoding the other 
percept would be quiescent. We will study this problem using an anatomically moti- 
vated neural field model of an orientation column with synaptic depression [54, 25], 
given by (1). We find that increasing the strength of synaptic depression from zero 
leads to a bifurcation whereby rivalrous oscillations onset. When rivalrous switching 
occurs through a combination of depression and noise, we show stronger depression 
improves the transfer of information using simple Bayesian inference [24]. We also 
analyze a competitive network model with depression and noise (6) to help study the 
combined eff'ects of noise and depression on perceptual switching. In particular, we 
will show that the presence of synaptic depression increases the information relayed 
by the output of the network. Finally, we will show trimodal stimuli to a neural field 
model with synaptic depression can generate oscillations where each mode spends time 
in dominance. To deeply analyze the relative contributions of noise and depression to 
this switching process, we study a reduced model. This reveals depression generates 
a history dependence in switching that would not arise in the network with purely 
noise-driven switching. 

2. Materials and Methods. 

2.1. Ring model with synaptic depression. As a starting point, we will 
consider a model for processing the orientation of visual stimuli [3, 7] which also 
includes short term synaptic depression [54, 25]. Since GABAergic inhibition is much 
faster than AMPA-mediated excitation [23], we make the assumption that inhibition 
is slaved to excitation as in [2]. Reduction this disynaptic pathway and assuming 
depression acts on on excitation [45], we then have the model [54, 26] 



du{x,t) 
dt 



'Tt/2 



= -u{x,t) + 



w{x - y)q{y, t)f{u{y, t))dy + I{x) + ^{x, t), (la) 
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r = 1 - q{x, t) - Pq{x, t)f{u{x, t)). (lb) 

Here t) measures the synaptic input to the neural population with stimulus pref- 
erence X at time t, evolving on the timescale Tm- Firing rates are given by taking the 
gain function f{u) of the synaptic input, which we usually proscribe to be [51] 

l+e--r(«-«) ' 

and often take the high gain limit 7 — ^ oo for analytical convenience, so [2, 26] 

: w < K, 

U > K. ^ 
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External input, representing flow from upstream in the visual system is prescribed by 
the time-independent function I{x) [3, 7]. For the majority of our study of (1), we 
employ the bimodal stimulus 

I{x) = -lo cos(4a;) + sin(2a;), (4) 

representing stimuli at the two orthogonal angles — 7r/4 and 7r/4 and Iq controls the 
mean of each peak and la controls the level of asymmetry between the peaks. Ef- 
fects of noise are described by the stochastic processe (^(a;,t)) with (^(x,t)) = and 
{S,{x, t)£^{y, s)) = C{x — y)5{t — a). For simplicity, we assume the spatial correlations 
have a cosine profile C(x) = 7rcos(a;). Synaptic interactions are described by the inte- 
gral term, so w{x — y) describes the strength (amplitude of w) and net polarity (sign 
of w) of synaptic interactions from neurons with stimulus preference y to those with 
preference x. Following previous studies, we presume the modulation of the synaptic 
strength is given by the cosine 

w{x-y)=cos{2{x-y)), (5) 

so neurons with similar orientation preference excite one another and those with 
dissimilar orientation preference disynaptically inhibit one another [3, 15]. The factor 
q{x, t) measures of the fraction of available presynaptic resources, which are depleted 
at a rate /3/ [1, 45], and are recovered on a timescale specified by the time constant 

r[n]. 

By setting = 1, we can assume time evolves on units of the excitatory synaptic 
time constant, which we presume to be roughly 10ms [18]. Experimental observations 
have shown synaptic resources specified q are recovered on a timescale of 200-800ms 
[1, 46], so we require r is between 20 and 80, usually setting it to be r = 50. Our 
parameter /3 can then be varied independently to adjust the effective depletion rate 
of synaptic depression. 

2.2. Idealized competitive neural network. We also study space-free com- 
petitive neural networks with synaptic depression [43] . In this way, we can make more 
progress analyzing switching behavior. As a general model of networks connected by 
mutual inhibition, we consider the system [29, 37, 43] 

UR{t) = -ui{t) + f{lR - qL{t)uL{t)) + a(i), (6a) 

UL{t) = -U2{t) + fih - qR{t)uR{t)) + 6(i). (6b) 

TqR{t) = 1 - quit) - l3uR{t)qR{t), (6c) 

TqL {t) = l- qL {t) - PUL {t)qL (t) , (6d) 
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where Uj represents the firing rate of the j = 1,2 population. The strength of recurrent 
synaptic excitation within a population is specified by the parameter a, whereas the 
strength of cross-inhibition between populations is specified by (3. Fluctuations are 
introduced into population j with the independent white noise processes with 
{Xj{t)) = and MtMs)) = s6{t - s). 

2.3. Numerical simulation of stochastic differential equations. The spa- 
tially extended model (1) was simulated using an Euler-Maruyama method with a 
timestep dt = 10~^, using Riemann integration on the convolution term with N 
= 2000 spatial grid points. The space clamped competitive network (6) was also 
simulated using Euler-Maruyama with a timestep dt = 10~^. To generate histograms 
of dominance times, we simulated systems for 20000s (2 x 10^ time units). 

2.4. Fitting dominance time distributions. To generate the theoretical curves 

presented for exponentially distributed dominance times, we simply take the mean of 
dominance times and use it as the scaling in the exponential (40). For those densi- 
ties that we presume are gamma distributed, we solve a linear regression problem. 
Specifically, we look for the constants ci, C2, and C3 of 

/(T) = e^'T^^e-""'^ (7) 

an alternate form of (42). Upon taking the logarithm of (7), we have the linear sum 

In /(T) = ci+C2 In T-caT. (8) 

Then, we select three values of the numerically generated distribution p"(r") along 
with its associated dominance times: {T{',p1); {T:^,p^); m^Ps) where p] = p"(Tj^). 
We always choose = argmaxTp"(T) as well as = and = 2,T^ jl. It is 

then straightforward to solve the linear system 

1 InTi" -Ti" \ / Cl \ / Inp^' \ 
1 lnT2" -T^ C2 = Inp^ 

1 InTg" -T3" / V C3 / V Inp^ / 

for the associated constants using the \ command in MATLAB. 

3. Results. We now discuss several results that reveal the importance of synap- 
tic depression in transferring information about stimuli to competitive networks. This 
is initially shown by analyzing the ring model with depression (1). Ifowever, to carry 
out detailed analysis on stochastic switching a competitive network with depression, 
we must reduce (1) as well as analyzing an analogous model without space (6). 

3.1. Deterministic switching in the ring model. To start wc will consider 
the ring model with depression (1) in the absence of noise. We then have the deter- 
ministic system [54, 26, 27] 

.77/2 

utix,t) = -u{x,t) + w{x -y)q{y,t)f{u{y,t))dy + I{x), (9a) 

J-tt/2 

Tqt{x, t) = l- q{x, t) - /3q{x, t)f{u(x, t)). (9b) 

In previous work, versions of (9) have been analyzed to explore how synaptic de- 
pression can generate traveling pulses [54, 26], self-sustained oscillations [26], and 
spiral waves in two-dimensions [27]. Here, we will extend previous work that explored 
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Fig. 1. Three possible active states of the noise-free stimulus driven ring model with depression 
(9). (A) Winner take all (Ig = 0.6^; (B) Rivalrous oscillations (Iq = 0.84J; (C) Fusion (Iq = 1). 
Other parameters are = 0.5, 13 = 1, and t = 50. 



input-driven oscillations in two-layer networks like (9) that possessed many statistics 
matching binocular rivalry [25]. We will think of (9) as a model of monocular ri- 
valry, since oscillations can be due to competition between representations in a single 
orientation column [3]. Competition between ocular dominance columns [25] is not 
necessary for our theory. For the purpose of exposition, we will employ specific forms 
for the functions of (9): cosine weight (5); a Heaviside firing rate function (3); and a 
bimodal input (4). 

Winner take all state. We start by looking for winner-take-all solutions to the 
deterministic system (9), such as that shown in Fig. 1(A). These states consist of a 
single activity bump arising in the network, representing only one of the two percepts 
contained in the bimodal stimulus (4). These are stationary in time, so Ut = qt — 0, 
implying that u — U{x) and q — Q{x). Also, they are single bump solutions, so there 
will be a single region in space that is superthreshold [U [x) > k). We assume the right 
stimulus is represented by a bump, although we can derive analogous results when 
the left stimulus is represented. We then have a steady state solution determined by 

U{x)^ cos{2{x - y))Q{y)dy ~locos{4x)+ la sm{2x), (10) 



Q{x) ^ [l + (iH{U{x)-K)\ 



(11) 



since U{x) > k when x E (7r/4 — a,7r/4 -I- a), so that by plugging (11) into (10) and 
using the trigonometric identity cos(2(a; — y)) = cos(2a;) cos(22/) -I- sin(2x) sin(2j/) we 
have 

U{x) = 74cos(2a;) + B sm{2x) - !„ cos(4a;) + /„ sin(2x), (12) 
where the multiplicative constants A, B can be computed 

1 r^'^^" , ^ , „ 1 r^^^" , X , sin(2a) 
A= / cos(2a;)dx = 0, B = / sin(22:)dx = — ^ — -. 

Therefore, by simplifying the threshold condition, U^n/A ± a) = k, we have 



C/(7r/4 ± a) = 2^^^% ^ ^" <=os(4a) + la cos(2a) = k. 



(13) 
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The implicit equation (13) can be solved numerically using root finding algorithms. 
For symmetric inputs (/„ = 0), we can solve (13) explicitly 



tan 



1± Vl + 4(l+^)2(Jg 
2(1+/3)(7o + k) 



k2) 



Thus, we can fully characterize winner-take-all solutions 



U [x) = ^-^^^ sm.{2x) - lo cos(4a;) + la sin(2a;). 



(14) 



(15) 



The advantage of having this solution is that we can relate the parameters of the 
model to the existence of the winner-take-all state, where we would expect to only see 
single bump solutions. To do so, we need to look at a second condition that must be 
satisfied, U{x) < k for all x ^ (7r/4 — a, 7r/4 -|- a). Since the function (15) is bimodal 
across (—• 7r/2,7r/2), we check the other possible local maximum at a; = — 7r/4 as 



U{Tr/4) =Io-Ia 



sin(2a) 
1 + /3 



< K. 



(16) 



At the point in parameter space where the (16) is violated, a bifurcation occurs, so 
the winner-take-all state ceases to exist. This surface in parameter space is given by 
the equation 



Ia + 



sin(2a) 



(17) 



along with the explicit formula for the bump half- width (14). Beyond the bifurcation 
boundary (17), one of two behaviors can occur. Either there is a symmetric two-bump 
solution that exists, the fusion state [52, 4, 43], or rivalrous oscillations [32, 5]. 

Fusion state. It has been observed in many experiments on ambiguous stimuli 
that sufficiently strong contrast rivalrous stimuli can be perceived as a single fused im- 
age [4, 8] . This should not be surprising, considering stereoscopic vision and audition 
behave in exactly this way [52]. However, the contrast necessary to evoke this state 
with dissimilar images is much higher than with similar images [5]. In the network 
(9), the fusion state (Fig. 1(C)) is represented as two disjoint bumps. Therefore 



Uix) 



1 



-7r/4+b r-Tr/4+a 
+ / 

7r/4— b J7r/4— a 



Computing the integral terms, we find 



cos(2(a; — y))dy — Iq cos(4a;) -|- /„ sin(2a;). 



, S(x,a) — S(x,b) ^ ... . N 
= l + B ~ ° cos(4a;) + sin(2a;), 



(18) 



where S{x,y) = sin^(a; + y) — sin^(x — y). The solution can be specified by requiring 
the threshold conditions [/(— 7r/4 ± b) = U^n/A ± a) = k are satisfied 



cos(2a)[sin(2a) - sin(26)] 

cos(26)[sin(26) - sin(2a)] 
1+^ 



+ Iq cos(4o) + la cos(2a) = k, 
+ Iq cos(46) — la cos(26) = K, 



(19) 
(20) 
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which we can solve numerically to relate the asymmetry of inputs la to the half- 
widths a, h of each bumps. In the the case of symmetric inputs, U (x) = — /q cos(4a;), 
and it is straightforward to find the two bump widths explicitly. We will now study 
rivalrous oscillations by simply constructing them using a fast-slow analysis. We can 
also numerically identify the boundary of various behaviors of (1), as shown in Fig. 
3. 

Rivalrous oscillations. Oscillations can occur, where the two bump locations 

trade dominance successively (Fig. 1(B)). As in Lcvelt's proposition (i), increasing 
the contrast of a stimulus leads to that stimulus being in dominance longer. This infor- 
mation is not revealed when the system is stuck in a winner-take- all state. Therefore, 
the introduction of synaptic depression into the system (9) leads to an increase in 
information transfer. We will also examine how well (9) recapitulates Levelt's other 
propositions concerning the mean dominance of percepts. 

To analyze (9) for oscillations, we assume that the timescale of synaptic depression 
T ^ 1, long enough that we can decompose (9) into a fast and slow system [29, 25]. 
Synaptic input u then tracks the slowly varying state of the synaptic scaling term q. 
We also assume that q is essentially piecewise constant in space, in the case of the 
Heaviside nonlinearity (3), which yields 



I-7V/2 

u{x, t)^ cos(2(a; - y))q{y, t)H{u{y, t) - K)dy - Iq cos(4a;), (21) 

J— it/2 



and q is governed by (9b). To start, we will also assume a symmetric bimodal input. 
This way, we can simply track q in the interior of one of the bumps, given gj(<) = 
q{n/4:,t). Assuming a switch has just occurred, where the left bump has escaped 
suppression, to pin down the right, so 

Tqi{t) = l-qi{t), tG{0,T), qi{0) = qo, (22) 

where T is the amount of time each percept is in dominance, and go is the synaptic 
strength within a bump region immediately prior to its shutting off. After the right 
bump escapes dominance of the left bump 

Tqi{t) = l-qiit)-^qiit), t € {T, 2T) . (23) 

Solving (22) and (23) simultaneously, we have 

1 , P 



qo 



1 + /3 1 + 13 
which can be solved explicitly for the dominance time 

> + - 4(1 + I3){1 - go)[(l + l3)qo - 1] 



T = Tln 



2(1 + p)qo - 2 



(24) 



so that we now must specify the value go- We can examine the fast equation (21), 
solving for the form of the slowly narrowing right bump during its dominance phase 

/■7r/4+a(t) 

u{x,t) = qi(t) / cos(2(a; — y))dy — /q cos(4a;) 

J7r/4-a(t) 

= qi{t) [sm^{x + a{t)) - sin^(a; - a{t))] - Iq cos(4a;). (25) 
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We can solve for the slowly evolving width a{t) of this bump by requiring the threshold 
condition u{-k/A ± a{t), t) = k to yield 



sin(4a(t)) + Iq cos(4a(t)) = k, 



and then solving using trigonometric identities 



a{t) 



tan 



9.W + N/g.(t)^+4(Jg-/,2) 
2(/o + k) 



(26) 



We can also identify the maximal value of qi (t) = qo which still leads to the right bump 
suppressing the left. Once qi{t) falls below qo, the other bump escapes suppression, 
flipping the dominance of the current bump. This is the point at which the other 
bump of (25) rises above threshold, as defined by the equation 

u(— 7r/4, t) = Iq — qo sin(2ao) = k. 

Combining this with the equation (26), we have an algebraic equation for qo given 



{lo - «)' 



„2 ^Qo 



4(/o' 



2qoVq[+Wl^ 



8/2 + 8IoK + 2ql + 2qo^ql+A{Il-K^y 
which is straightforward to solve for 



90 



2/ox/(/o-«)(3Jo + k) 

3/o + K 



(27) 



and we have excluded an extraneous negative solution. Interestingly, the amplitude of 
synaptic depression is excluded from (27), but we do know based on (22) and (23) that 
qo G ([1 + /3]~^, 1). This establishes a bounded region of parameter space in which we 
can expect to find rivalrous oscillations, which we use to construct a partitioning of 
parameter space in Fig. 3. We can also now approximate the dominance time using 
(24) with (27), as shown in Fig. 2(D). 

In the case of an asymmetric bimodal input (/„ > 0), we can also solve for explicit 
approximations to the dominance times of the right Tr and left Tl populations. 
Following the same formalism as for the symmetric input case 



TR = Tln 
Tl =Tln 



p+qd+ viP + - 4(1 +m- + 

2(l + /3)to-2 

/3 - gd + ViP - qaY - 4(1 + /3)(1 - gfl)[(l + /3)^r^ 
2(l + /3)gi-2 



(28) 
(29) 



where = (1 + (3){qR — qh)-, in terms of the local values qL and qa of the synaptic 
scaling in the right and left bump immediately prior to their suppression. Notice in 
the case q^ — qn, then — and (28) and (29) both reduce to (24). We now need to 
examine the fast equation (21) to identify these two values. This is done by generating 
two implicit equations for the half-width an and qa at the time of a switch 



(IR 



sin(4aH) + Iq cos(4a77) + /„ cos(2ai?) 

lo- la - Qr sin(2aK) 



(30) 
(31) 
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Fig. 2. Dependence of rivalry on the amplitudes of the bimodal input (4)- (A) Dominance 
times are both T ft: Is when input is symmetric (Iq = 0.84, la = 0). (B) Dominance time of right 
input (Tji ^ 0.9s) is longer than left (Ti^ ^ O.Gs) for asymmetric input (In = 0.9, II = 0.84). 
Notice Tji is unchanged from case (A). (C) Dominance times are both shorter of higher contrast 
symmetric stimulus (Iq = 0.9, la = 0). (D) Increasing the strength of the symmetric (la = 0) 
bimodal input (4) decreases the dominance time T of both populations. Our theory (black) computed 
from fast-slow analysis (24) fits results of numerical simulations (blue) well. (E) For asymmetric 
inputs (la 0), we find that varying Ir = lo + la while keeping II = lo ~ la fixed changes the 
dominance times of the left percept Tj^ (blue) much more than that of the right percept Tjj (red). 
Other parameters are k = 0.5, /3 = 1, and r = 50. 



which we can solve expUcitly for 



and 



qR = 



cos 



2h 



2/o(/l - >i) 



(32) 



(33) 



where II = Iq — la is the strength of input to the left side of the network. Likewise, 
we can find the value of the synaptic scaling in the left bump immediately prior to 
its suppression 



QL = 



2Io{Ir-h) 
^{3Io + k){Io-k)' 



(34) 



where /fl = la + la is the strength of input to the right side of the network. Using 
the expressions (33) and (34) we can now compute the dominance time formulae (28) 
and (29), showing the relationship between inputs and dominance times in Fig. 2 
(E). Notice that all of Levelt's propositions are essentially satisfied. Changing the 
strength of the right stimulus In has a very weak effect on the dominance time of the 
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Fig. 3. Partitions of parameter space into various stimulus induced states of (9). (A) Plotted 
as a function of network threshold k and strength Iq of the bimodal input (4) when f} = i. (B) 
Plotted as a function of synaptic depression strength 13 and strength Iq of bimodal input (4) when 
K = 0.5. Parameter t = 50. 



right percept. However, it does increase the overah alternation rate and decrease the 
proportion of time the left percept remains in dominance. 

We also note there is a critical strength of synaptic depression j3 below which 
rivalrous oscillations do not occur. When synaptic depression is sufficiently strong, 
the winner-take- all state ceases to exist. Beyond this critical synaptic depression 
strength, the network either supports rivalrous oscillations or a fusion state. Either 
way, information is conveyed to the network that would otherwise be kept hidden. We 
show this in Fig. 3(B). In this way, synaptic depression can improve the information 
transfer of the network (9). In fact, we will show that it does so in a way that is much 
more reliable than noise. 

3.2. Purely stochastic switching in the ring model. We will now study 
rivalrous switching brought about by fluctuations. In particular, we ignore depression 
and examine the noisy system 



du{x, t) = 



-u{x,t)+l w{x - y)f{u{y,t))dy + I{x) 

-Tv/2 



dt + d^{x,t). (35) 



where {^{x, t)) ~ and {^{x, t)£_{y, s)) — eC{x — y)5[t — s) defines the spatiotemporal 
correlations of the system. 

To start, we can simply consider (35) in the absence of noise 

.7r/2 

ut{x,t) ^ -u{x,t) + w{x - y)f{u{y,t))dy + I{x). (36) 

J-Tl/2 

This model has been studied extensively [2, 3], so we will not perform an in depth 
analysis of stationary bump solutions. We are interested in the winner-take-all state. 
For a cosine weight (5), Heaviside firing rate (3), and bimodal stimulus (4) these can 
be computed as a special case of (15) and lie at either x — ±7r/4, so 

U±{x) = sin(2a±) sin(2.T) - Iq cos(4a;) ± la sin(2a;), (37) 
10 



T (seconds) T (seconds) T (seconds) 

Fig. 4. Purely noise induced switching of dominance in the depression-free ring model (35) 
(A) Numerical simulations of the system for the various input strengths of the symmetric (la = 
bimodal input (4) with (A) Iq = 0.8, (B) Iq = 0.9, and (C) lo = 1.0. Distributions of dominance 
times computed numerically (blue bars) with the exponential distribution (40) with numerically com- 
puted mean (T) (red) superimposed for (D) Iq = 0.8 has (T) 1.2s, (e) Iq = 0.9 has (T) 0.70s, 
and (f) Iq = 1.0 has (T) 0.45s. Other parameters are k = 0.5 and e = 0.04. 



and we can apply the threshold condition [/±(±7r/4 + a±) = k, so 

^ sin(4a±) + Iq cos(4a±) ± la cos(2a±) — k. (38) 
In the case of a symmetric input, a± = a and we can solve (38) explicitly 

(39) 



1 _i 
a — - tan 



l + v/l + 4(/g-^2) 
2(/o + k) 



Since we have no synaptic depression in the model (36), we cannot rely on deter- 
ministic mechanisms to generate switches between one winner-take-all state and an- 
other. Therefore, we will consider the effects of introducing a small amount of noise 
(0 < e <C 1), reflective of synaptic fluctuations. We focus on the spatial correlation 
function C{x) = cos(a;). Noise can generate switches in between the two dominant 
states (Fig. 4). As the strength of both input contrasts are increased, switches be- 
tween percepts occur more often. In fact, there is an exponential dependence of the 
mean dominance time (T) on the strength Iq of the bimodal input (4). We will pro- 
vide an argument, using energy methods, as to why this occurs in our analysis of the 
simplified system (6). 

We will study switching in the case that the suppressed bump comes on and 
shuts off the bump that is currently on. This mechanism is known as escape [48] . As 
shown in Fig. 4(A-C), dominance times decrease with input strength on average. 
Here escape is a noise induced effect [37], rather than a deterministic depression- 
induced effect [43, 25]. However, as opposed to depression-induced switching, there is 
a substantial spread in the possible dominance times for a given set of parameters (Fig. 
4(D-F)). Thus, by examining two dominance times back to back, an observer would 
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Fig. 5. Sampling of the strength of each input based on noise-induced transitions. (A) Mean 
dominance time (T) as a function of the strength Iq of the symmetric (la = 0) bimodal input (4-)- 
(B) Probability that the right input Iji was higher than the left output 7^, based on the sampling 
n cycles (2n switches between percepts), in the case of symmetric inputs II = Ir = 0-9- Notice 
it takes close to 2000 cycles before p[Ir > /i|T*(n)] 0.5. Other parameters are k = 0.5 and 
e = 0.04. 



have difficulty telling if the input strengths were roughly the same or not. Recently, 
psychophysical experiments have been carried out where an observer must identify 
the higher contrast of two rivalrous percepts [36] , showing humans perform quite well 
in at this task. Results of [36] suggest humans most likely use Bayesian inference 
in discerning information about visual percepts. This is in keeping with previous 
observations that humans' visual perception of objects likely carries out Bayesian 
inference [24]. 

Notice in Fig. 5(B) that the likelihood an observer assigns to In > II approaches 
1/2 as the number of observations n increases. We compute p[Ir > /L|T*(n)], the 
probability an observer would presume Ir > Ir conditioned on dominance time pairs 
from n cycles T*{n) = |r]j^\r|,^',r]j^\r^^\ ...,T^"\t}"^|, numerically here. How- 
ever, as the number of cycles n — > cx), the exponential distributions approximately 
defining the identical probability densities pr(Tr) = pl(Tl) = p{T) will be fully 
sampled. We can calculate 

/•OO t^X -1 /"OO t^X -1 

p{Ir > Il\T*{^)) - p{x)p{y)dyAx = Jf^ e'^^+y^ ' dydx = -, 

as in Fig. 5(B). In the case where depression, in the absence of noise, drives switches, 
we would expect this limit to be approached much more quickly. We will also examine 
how a combination of depression and noise affects an observer's ability to discern 
contrast differences. 

We explore this further in the case of asymmetric inputs, showing that dominance 
times are still specified by roughly exponential distributions as shown in Fig. 6. When 
Ir > II, even though the means satisfy (Tr) > {Tl) (Fig. 6(E)), the exponential 
distributions p{Tr) and p{Tl) have considerable variance, also given by the means. 
Therefore, randomly sampled values from these distributions may satisfy Tr < T^. 
Were an observer to use one such sample as a means for guessing the inputs that 
generated them, they would guess Ir < II, rather than the correct Ir > 1^. In terms 
of conditional probabilities, we can expect situations where p{Ir > Il\T*(7i)) < 1/2 
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Fig. 6. (A) Single realization of the stochastic neural field (35) with asymmetric (la > Oj 
inputs In = 0.92 and II = 0.88, leads to longer dominance times for right percept Tn. (B) 
Expected likelihood p[Iji > Iii\T*{n)] that the right input In is stronger than left Ij^ based on n 
comparisons of dominance times Tr and Ti^ sampled. Blue line is theoretical prediction (41) of the 
limit as n oo. Numerically computed dominance time distributions (blue bars) are well fit by the 
exponential distribution (4-0) for the (C) left ({Ti) 0.5sj and (D) right ({Tn) Is) percepts. 
(E) Dependence of mean dominance times (Tn) and {Ti,) on the strength of the right input In. 
Black curves are best fits to exponential functions of In- (F) Expected likelihood p[I n > Il\T*{oo)\ 
right input In is stronger than left II in the limit of high sample number n — > oo, as computed 
theoretically by (41)- Other parameters are k = 0.5, and e = 0.04. 

for finite n, even though In > 1^. We can quantify this effect numerically, as shown 
in Fig. 6(F). In the limit n oo, we find uncertainty continues to creep in, since 
fluctuations continually give an observer misleading information. Since the marginal 
distributions are approximately exponential 

p,(r,)=e-^^/<^^V(T;> J=L,R, (40) 
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Fig. 7. Switching in the stochastic ring model with depression (1) with asymmetric inputs 
(la > Oj. (A) Single realization for asymmetric inputs with In = 0.92 and Ij^ = 0.88, which 
leads to right percept dominating longer, (B) Distribution of left percept dominances times PhiTi,) 
over 1000s is well fit by a gamma distribution (42). (C) Distribution of right percept dominance 
times Pr{T[i) across 1000s is well fit by a gamma distribution (42). Other parameters are k = 0.5, 
/3 = 0.2, r = 50, and e = 0.01. 



we can approximate the conditional probability 



/>oo i^x 

Jo Jo 



'■%--/>TH>g-y/>T^>dyda; 



{Tb))Tl)Jo Jo 
= 1- = (41) 

Observe that the approximation we make using the formula (41) accurately estimates 
the limit p{Ib. > Il\T* (oo)) as shown in Fig. 6(B). This is the likelihood that an 
observer performing Bayesian sampling of the probability densities (40) will predict 
Ir > II. Recent psychophysical experiments suggests humans would perform this 
task of contrast differentiation in this way [24, 36]. 

We see from our analysis that when switches are generated by noise, rather than 
deterministic depression, the means dominance times still obey Levelt's propositions 
to some extent (Fig. 6(E)). This would allow for an accurate comparison of input 
strengths Ir and II based on the means {Tr) and (Tl). However, when consider- 
ing a more realistic observer, that could only compare successive dominance times, 
accurately discerning the comparison of the input contrasts is more difficult. This be- 
comes much more noticeable when the input contrasts are quite close to one another, 
as in Fig. 6(F). We will explore now how introducing depression along with noise 
improves discernment of the input contrasts by an observer using simple comparison 
of dominance times. 

3.3. Switching through combined depression and noise. We now study 
the effects of combining noise and depression in the full ring model of perceptual 
rivalry (1). Numerical simulations of (1) reveal that noise- induced switches occur 
robustly, even in parameter regimes where the noise-free system supports no rivalrous 
oscillations, as shown in Fig. 7. Rather than dominance times being distributed 
exponentially, they roughly follow a gamma distribution [17, 30] 
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Fig. 8. Comparing the probability densities of dominance times in the stochastic ring model 
with depression (1) for various levels of noise and depression. (A) No depression and e = 0.04 (B) 
Depression strength /3 = 0.2 and e = 0.01. (C) Depression strength f} = 0.4 and e = 0.0025. (D) 
Expected likelihood p[Iji > //{|T*(oo)] the right input In is stronger than the left Ij^ based in the 
limit of an infinite number of samples of the dominance times Tji and Tn for the parameters in (A) 
(pink); (B) (magenta); and (C) (red). Other parameters are t = 50 and k = 0.5. 

As opposed to the exponential distribution, (42) is peaked away from zero at Tj = ka, 
which is also the mean of the distribution. Therefore, two distributions of dominance 
times with different means will be more easily discerned from one another. We show 
this in Fig. 8(B) by superimposing the two distributions from Fig. 7 on top of 
one another. They clearly separate better than in the probability densities of purely 
noise-driven switching, shown in Fig. 8(A). As the strength of synaptic depression is 
increased even further, keeping the mean dominance times (Tr) and {T^) the same, 
probability densities separate even further (Fig. 8(C)). We summarize how this sep- 
aration improves the inference of input contrast difference in Fig. 8(D). As the 
strength /3 of depression is increased and noise is decreased, an observer's ability to 
discern which input was stronger is improved. The likelihood assigned to Ir being 
greater than II is a sigmoidal function of In whose steepness increases with /3. For no 
noise, the likelihood function is simply a step function > 1^), implying perfect 

discernment. 

3.4. Analyzing switching in a reduced model. We now perform similar 
analysis on a reduce competitive network model (6) and extend some of the results for 
the ring model. One of the advantages is that we can construct an energy fimction [20], 
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which provides us with intuition as to the exponential dependence of mean dominance 
times on input strengths in the noisc-drivcn case. In particular, wc will analyze (6) 
where the firing rate function is Heaviside (3), starting with the case of no noise 



ur = -ur + H{Ir- qlUl), iiL = -ul+ H{Il- qnun) (43a) 
tqr = 1 - qr - l3uRqR, tql = 1 - qL - PulQl- (43b) 

First, we note (43) has a stable winnor-takc-all solution in the jth population (j = 
R, L) for Ij > and Ik < 1/(1 + /3) {k ^ j). Second, a stable fusion state exists when 
both Il,Ir > 1/(1 + /5)- Coexistent with the fusion state, there may be rivalrous 
oscillations, as wc found in the spatially extended system (9). To study these, we 
make a similar fast-slow decomposition of the model (43), assuming r ::|> 1 to find 
Uj's possess the quasi-steady state 



Ur = H{Ir - QlUl), Ul = H{Il - QrUr). 



(44) 



so we expect uj = or 1 almost everywhere. Therefore, we can estimate the domi- 
nance time of each stimulus using a piecewise equation for the slow subsystem 



TQj 



1 



9i 



1, 
0, 



R,L. 



(45) 



Combining the slow subsystem (45) with the quasi-steady state, wc can use self- 
consistency to solve for the dominance times Tr and of the right and left popula- 
tions. We simply note that switches will occur through escape mechanism, when the 
cross-inhibition between populations becomes weak enough such that the suppressed 
population's (j) input becomes superthreshold, so Ij = qk- Using (45) as we did in 
the spatial system, we find 



TR = T\n 



TL = Tln 



' P-Id + Via - Id)' - 4(1 - Ir){1 + /3)[(1 + /3)/I^ 
2(1 + I3)Il - 2 

' P + Id + V{I3 + Id)' - 4(1 - + Pm + P)7^^] 
2(1 + /3)Ir - 2 



(46) 
(47) 



where Id = {1 + 13)[Ir — II]- For symmetric stimuli, II = Ir = I, both (46) and (47) 
reduce to 



T = Tln 



(3 + - 4(1 ^ /)(1 + /3)[(1 + P)I - 1] 
2{1 + 13)1 -2 



(48) 



using which we can solve for the critical input strength I above which only the fusion 
state exists 



2 + 13 
2(1 + P 



(49) 



in the case of symmetric inputs. We show in Fig. 9 that this asymptotic approxima- 
tions (46) and (47) of the dominance times match well with the results of numerical 
simulations. Levelt's propositions are recapitulated well. 
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Fig. 9. Dominance times computed adiabatically in the noise-free competitive network with 
depression (43). (A) Plot of dominance times T as a function of the strength of a symmetric 
input Iii = I]^ = Ito the competitive system (43) show fast-slow theory (curve) match numerical 
simulations (dots) very well. (B) Dominance times and Tn as a function of right input Iji 
keeping = 0.6 fixed as computed by theory (curves) in (46) and (47) fits numerically computed 
(dots) very well. Other parameters are /3 = 1 and r = 50. 




Fig. 10. Noise induced transitions in depression-free two population network (50). (A) Single 
realization with Ir = II = I = 0.9 for the right un (red) and left ul (black) population activities. 
(B) Mean dominance time (T) as a function of input strength I computed numerically (red dots) 
and fit to the theoretically derived exponential function (51). (C) Mean dominance times (Tn) and 
{Ti) as a function of the right input strength Ir while If^ = 0.95 is fixed. Other parameters are 
e = 0.01. 



Now, we study noise-induced switching in the competitive network. We can sep- 
arate timescales to study the effects of depression and noise together. To start, we 
consider the limit of no depression /3 — > 0, so that 

ur. = -uji + H{Ir-ul)+^r, (50a) 

UL^-UL + H{lL-UR)+iL, (50b) 

where are independent white noise processes with variance e. We show a single 
realization of the competitive network in Fig. 10(A). Most of the time, the dynamics 
remains close to one of the winner-take-all attractors where Uj = 1 and Uk — 
(j = R,L and k ^ j). Occasionally, noise causes large deviations where the suppressed 
population's activity rises above threshold, causing the once dominant population to 
then be suppressed. We plot the relationship between the strength of the inputs 
and the mean dominance times in Fig. 10. Notice, in the case of symmetric inputs 
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Il = Ir = I, we can fit this relationship to the exponential 

(T)«Aexp[B(l-/)]. (51) 

To understand why this is so, we can study the energy function associated with the 
system (50). Notice, this network is essentially the classic two neuron flip-flop Hopfield 
network. As has been shown before [20], for symmetric inputs the energy function for 
this network can be defined 

E[uR, ul] = H{I - ur)H{I -ul)-I [H{I - ur) + H{I - ul)] , (52) 

so we can compute the energy difference between the winner-take-all and fusion states 

E[1,0]=-I, E[l,l] = l-2/, (53) 

respectively. By taking the difference between these two quantities, we find AE = 
1 — 7, which well approximates the exponential dependence of the dominance times T, 
as shown by the fit in Fig. 10(B). This provides the intuition as to this relationship. 

In the same way, we can write down the energy function in the case where the 
inputs are non-symmetric Ir^ II, which is [12] 

E[ur, ul] = H{Il - ur)H{Ir - ul) - IlH{Il - ur) - IrH{Ir - ul), (54) 

which means the energy depth of the right and left winner-take-all states are AEr = 
1 — Il and AEl = 1 — Ir, respectively. Thus, as we observe in Fig. 10(C), we 
expect the dominance time of each population to depend upon the strength of the 
other stimulus according to 

{Tr) « Arb^p [Br{1 - II)] , (Tl) « Al exp [Bl{1 - Ir)] • (55) 

Interestingly, this simple model agrees well with the qualitative predictions of Levclt 
propositions (i-iv) in this high contrast input regime. Now, we will see how including 
synaptic depression in the model generates distributions of dominance times that are 
more similar to those observed experimentally [17, 30, 6]. 

Finally, we show that the network with depression and noise generates gamma 
distributed dominance times, as the spatially extended system does. In addition, we 
provide some analytic intuition as to how gamma distributed dominance times may 
arise in the fast slow system. First, we display as single realization of the network 
(6) in Fig. 11(A) along with a plot of an adiabatically computed energy function 
E[ur,ul, Qr, Ql] for the system. To compute the energy function, we first note that 
in the limit of slow depression recovery time t :$> 1, we can assume the energy of the 
system will be defined simply by (52) augmented by the synaptic scalings imposed by 
qr and ql [34]. In the fully general case, where inputs Ir and II may be asymmetric 
we have 

E[ur, Ul, Qr, Ql] =II{Il - qRUR)H{lR - qlUl) 

Il Ir 
II{Il - QrUr) II{Ir - qlUl). (56) 

Qr Ql 

A similar energy function was previously used in a model with spike frequency adap- 
tation [37]. Here, we are able to derive the energy function from the model (6). 
Therefore, the energy gap between a winner-take-all state and the fusion state will 
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Fig. 11. (A) Single realization of the network (6) with depression and noise. Activity variables 
uji (black) and (blue) stay close to attractors at and 1, aside from depression or noise induced 
switching. Depression variables qji (red) and (green) slowly exponentially change in response to 
the states of un and u^. (B) Probability density p{T) of dominance times T sampled over 1000s, 
well fit by a gamma distribution (42). Parameters are e = 0.036, /9 = 0.2, r = 50, and I = 0.8. 
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Fig. 12. Distribution of dominance times for (A) right and (B) left populations fit with red 
and blue gamma distributions (42) respectively, in the network (6) with depression and noise in the 
case of asymmetric inputs In = 0.82 and = 0.78, sampled over 1000s. The right population has 
longer dominance times. Other parameters are (3 = 0.2, r = 50, and e = 0.036. 



be time-dependent, varying as the synaptic scaling variables qp; and qj^ change. The 
energy difference between the right dominant state and fusion is 



^Enit) = 1 - 



II 
qnity 



AElH) = 1- 



qLity 



(57) 



for the right and left population respectively. 

Notice that dominance times of stochastic switching (Fig. 11) in (6) are dis- 
tributed roughly according to a gamma distribution (42). Superimposing the prob- 
ability density of right (left) dominance times on the left (right) probability density, 
we see they are reasonably separated. Using the analysis we performed for the spa- 
tially extended system, we could also show that depression improves discernment of 
the input contrast difference. Mainly here, we wanted to provide a justification as 
to the relationship between input strength and mean dominance times. Using en- 
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Fig. 13. Perceptual tristability. Examples of images with three possible interpretations. 
(A) Three overlapping gratings. Redrawn with permission from [38]. (B) 'Mother, father, and 
daughter.' Redrawn with permission from [16]. Staring at tristable images for long enough leads 
to the perception switching between the three possible interpretations. (C) Numerical simulation of 
showing the activity variables iti, U2, u^ and the second synaptic scaling variable q2 (cyan) of the 
three population network (58) driven by symmetric stimulus I = 0.7. (D) Relationship between the 
strength of the stimulus I and the dominance times T computed using fast- slow analysis (black) and 
numerics (red dots). Other parameters are /3 = 1 and r = 50. 



ergy arguments, we have provided reasoning behind why Leveh's propositions are 
still preserved in this model, when noise is included, even when switches are noise- 
induced. Increasing one input leads to a reduction in the energy barrier between the 
other population's winner-take-all state and the fusion state. This leads to the other 
population's dwell time being shorter. 



3.5. Switching between three percepts. Finally, we will compare the trans- 
fer of information in competitive networks that process more than two inputs. Re- 
cently, experiments have revealed that perceptual multistability can switch between 
three or four different percepts [16, 9, 38, 22]. In particular, the work of [38] charac- 
terized some of the switching statistics during the oscillations of perceptual tristabil- 
ity. Fig. 13(A,B) shows examples of tristable percepts. Since dominance times are 
gamma distributed and there is memory evident in the ordering of percepts, the pro- 
cess is also likely governed by some slow adaptive process in addition to fluctuations. 

We will pursue the study of perceptual tristability in a competitive neural network 
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model with depression and noise. In the case of three different percepts, a Heaviside 
firing rate (3), and symmetric inputs Ii = I2 = I3 = I, we study the system 



"ill = -ui + H{I - q2U2 - qsua), (58a) 

U2 = -U2 + H{I - qiui - qsua), (58b) 

U3 = -U3 + H{I-qiUi-q2U2), (58c) 

TQj = l-qj- Pujqj, j = 1, 2, 3. (58d) 



Wc are interested in rivalrous oscillations, which do arise in this network for certain 
parameter regimes (Fig. 13(C)). As per our previous analysis, we perform a fast-slow 
decomposition of our system. In the case of symmetric inputs, we use our techniques 
to compute the dominance time T of a population as it depends on input strength 
I. Our analysis follows along similar lines to that carried out for the two population 
network, where we assume r » 1. We find 

^ , Rl - J)(l + /j) + V{1 +m- ^)[3/(l + /3) + /? - 3] 
^ = 2[(l + ^)/-l] 

which compares very well with numerically computed dominance times in Fig. 13(D). 
While perceptual tristability has not been explored very much experimentally [16, 
38, 22], observations that have been made suggest that relationships between mean 
dominance time and input contrast may be similar to the two percept case [22]. In 
our model, we see that as the input strength is increased, dominance times decrease. 
One other important point is that percept dominance occurs in the same order every 
time (Fig. 13(C)): one, two, three. There are no "switchbacks." We will show that 
this can occur in the noisy regime, which degrades information transfer. 

Now, we seek to understand how noise alters the switching behavior when added 
to the deterministic network (58). Thus, we discuss the three population competitive 



network with noisy depression 

ill = -Ml + H{I - q2U2 - qsus), (60a) 

U2 = -U2 + H{I - qiu-i - q^us), (60b) 

us = -us-\- H{I - qiui- q2U2), (60c) 

rqj = 1- qj - Pujqj i = 1,2,3, (60d) 



where are identical independent white noise processes with variance e. In Fig. 14, 

wc show the noise in (60) degrades two pieces of information carried by dominance 
switches: the switching time and the direction of switching. Notice that as the ampli- 
tude of noise e is increased, the dominance times become more spread out. Thus, there 
is a less precise characterization of the input strength in the network. Concerning the 
direction of switching, we see that the introduction of noise makes "switch backs" 
more likely. We define a "switch back" as a series of three percepts that contains the 
same percept twice (e.g. 1 — >■ 3 — >■ 1). This as opposed to a "switch forward," which 
contains all three percepts (e.g. 1 — >■ 3 — > 2). Statistics like these were analyzed 
from psychophysical experiments of perceptual tristability, using an image like Fig. 
13(A) [38]. The main finding of [38] concerning this property is that switch forwards 
occurred more often than chance would suggest. Therefore, they proposed that some 
slow process may be providing a memory of the previous image. Wc suggest short 
term depression as a candidate substrate for this memory. As seen in Fig. 14, the bias 

21 



(59) 



2 4 6 

/ (seconds) 



i2 



2 4 6 

t (seconds) 



!^ 2 



2 4 6 

/ (seconds) 



i''= 



previous current other 
percept percept percept 

switchback switch -forward 



0.0002 



0.2 0.4 0.6 0.8 1 1.2 1.4 

T (seconds) 



previous current 
percept percept 



other 
percept 



switchback switch-forward 




0.0004 



0.2 0,4 0,6 0,8 1 1,2 1,4 

r (seconds) 



previous current 
percept percept 



0.396 ( ^) 0.604 '(^^ 



switchback switch-forward 




0.2 0.4 0.6 0.8 1 1.2 1.4 

T (seconds) 



Fig. 14. Noise degrades two sources of information provided by dominance switches. (A) In 
the absence of noise, switches always move "forward, " so that the previous percept perfectly predicts 
the subsequent percept. Dominance times accumulate at a single value too. (B) For slightly higher 
levels of noise (^/e = 0.0002j, "switch backs" can occur where the subsequent percept is the same 
as the previous percept. Also, the distribution of dominance times spreads. (C) For stronger noise 
(^/s = 0.0004j. Other parameters are Iq = 0.6, /3 = I, and t = 50. 
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Fig. 15. The probability of a switch being in the forward direction in simulations of (60) 
as a function of the amplitude e of noise. As e increases, network switches behave in more of a 
Markovian way, not reflecting any memory of the previous percept. Therefore, information of the 
previous percept is lost as soon as a switch occurs. 
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in favor of switching forward persists even for substantial levels of noise. The idea 

of short term plasticity as a substrate of working memory was also recently proposed 
in [35]. Our results extends this idea, suggesting synaptic mechanisms of working 
memory may be useful in visual perception tasks, such as understanding ambiguous 
images. In Fig. 15, we show that the process of dominance switching becomes more 
Markovian as the level of noise yje is increased even a modest amount. In the limit 
of large noise, the likelihoods of "switch forwards" and "switch backs" are the same. 

4. Discussion. Mechanisms imderlying stochastic switching in perceptual ri- 
valry have been explored in a variety of psychophysical [17, 30, 6], physiological [31, 5], 
and theoretical studies [33, 29, 37]. Since psychophysical data is widely accessible, 
it can be valuable to use the hallmarks of its statistics as benchmarks for theoreti- 
cal models. For instance, the fact that dominance time distributions are unimodal 
functions peaked away from zero suggests that some adaptive process must underlie 
switching in addition to noise [29, 6, 44]. In addition, [36] recently suggested the 
visual system may sample the posterior distribution of interpretations of bistable im- 
ages. This type of sampling can be well modeled by attractor networks analogous to 
those presented here [37] . Therefore, many dominance time statistics from perceptual 
rivalry experiments can be employed as points of reference for physiologically based 
models of visual perception. New data now exists concerning tristable images showing 
this process also likely is guided by a slow adaptive process in addition to fluctuations 
[38]. 

We have studied various aspects of competitive neuronal network models of per- 
ceptual multistability that include short term synaptic depression. First, we were able 
to analyze the onset of rivalrous oscillations in a ring model with synaptic depression 
[54, 25]. Stimulating the network with a bimodal input leads to winner-take-all so- 
lutions, in the form of single bumps, in the absence of synaptic depression. As the 
strength of synaptic depression is increased, the network undergoes a bifurcation 
which leads to slow oscillations whose timescale is set by that of synaptic depression. 
Each stimulus peak is represented in the network by a bump whose dominance time is 
set by the height of each peak. Thus, synaptic depression reveals information about 
the stimulus that would otherwise be masked by the lateral inhibitory connectivity 
of the network. The inclusion of noise in the network leads to dominance times that 
are exponentially (gamma) distributed in the absence (presence) of synaptic depres- 
sion. Motivated by recent work exploring how visiial perception may exploit Bayesian 
sampling on posterior distributions [24, 19, 36], we considered the simple task of an 
observer trying to infer the contrast of stimuli based on dominance times. We found 
Bayesian sampling of the dominance times discerning input contrast differences better 
as switches become more depression driven and less noise-driven. Thus, short term 
depression improves information transfer of networks that process ambiguous images 
in multiple ways. 

We also used energy methods in simple space-clamped neural network models to 
understand how a combination of noise and depression interact to produce switching 
in competitive neural networks. Using the energy function derived by Hopfield for 
analog neural networks, we justify the exponential dependence of dominance times 
upon input strength in purely noise-driven switching. Studying an adiabatically de- 
rived energy ftmction for the case of slow depression, we also show how depression 
works to reduce the energy barrier between winner-take-all states, leading to the slow 
timescale that defines the peak in depression- noise generated switches. Finally, using 
a three population space-clamped neural network, we analyzed depression and noise 

23 



generated switching that may underhc perceptual tristabihty. We found this network 
also sustained some of the same relationships between input contrast and dominance 
times as the two population network. Also, we found that when switches are gen- 
erated by depression there is an ordering to the population dominance that is lost 
when switches are noise generated. This is due to the ited by short 

term depression [35], so the switching process is non-Markovian due to the inherent 
slow timescale in the background. However, even small amounts of noise can wash this 
mc;niory away. Thus, since rc;cc!nt psychophysical experiments reveal a non-Markovian 
property to percept ordering, this provides further support for the idea that a slow 
adaptive process underlies percept switching. 

Note to analytically study the relationship between dominance times and input 
contrast in the noisy system, we resorted to a simple space-clamped neural network. 
In future work, we plan to develop energy methods for spatially extended systems like 
(35). Such methods have seen success in analyzing stochastic partial differential equa- 
tion models such as Ginzburg-Landau models [14]. Energy functions have recently 
been developed for neural field models, but have mostly been studied as a means of 
determining global stability in deterministic systems [53, 28, 40]. We proposed that 
by deriving the specific potential energy of spatially extended neural fields, it may 
be possible to approximate the transition rates of solutions from the vicinity of one 
attractor to another. In the system (35), there should be some separatrix between 
the two winner-take-all states that must be crossed in order for a transition to occur. 
The least action principle states that there is even a specific point on this separatrix 
through which the dynamics most likely flows [14]. Finding this with an energy func- 
tion in hand would be straightforward would allow us to relate the parameters of the 
model to the distribution of dominance times. This would provide a better theoretical 
framework for interpreting data concerning rivalry of spatially extended images, such 
as those that produce waves [50]. 
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